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The vulnerability to single-detector instrumental artifacts in standard detection methods for long- 
duration quasimonochromatic gravitational waves from nonaxisymmetric rotating neutron stars 
(‘continuous waves’, CWs) was addressed in past work [Keitel, Prix, Papa, Lead and Siddiqi, 
Phys. Rev. D 89, 064023 (2014)] by a Bayesian approach. An explicit model of persistent single¬ 
detector disturbances led to a generalized detection statistic with improved robustness against such 
artifacts. Since many strong outliers in semicoherent searches of LIGO data are caused by transient 
disturbances that last only a few hours, we extend the noise model to cover such limited-duration dis¬ 
turbances, and demonstrate increased robustness in realistic simulated data. Besides long-duration 
GWs, neutron stars could also emit transient signals which, for a limited time, also follow the GW 
signal model (tCWs). As a pragmatic alternative to specialized transient searches, we demonstrate 
how to make standard semicoherent GW searches more sensitive to transient signals. Considering 
tCWs in a single segment of a semicoherent search, Bayesian model selection yields a new detec¬ 
tion statistic that does not add significant computational cost. On simulated data, we find that it 
increases sensitivity towards tCWs, even of varying durations, while not sacrificing sensitivity to 
classical GW signals, and still being robust to transient or persistent single-detector instrumental 
artifacts. |f] 


I. INTRODUCTION 

Among the main search targets of terrestrial inter¬ 
ferometric detectors mm are continuous gravitational 
waves (CWs): periodic, narrow-band signals with a slow 
frequency evolution, emitted by rotating neutron stars 
with nonaxisymmetric deformations. mm Searches for 
CWs from unknown sources over wide parameter spaces 
are usually performed with semicoherent methods. EHn] 
For these, the data are split into several segments, each 
spanning part of the observation time. Each segment is 
analyzed coherently, and the resulting per-segment de¬ 
tection statistics are combined incoherently, e.g., by a 
sum. At fixed computational cost, semicoherent meth¬ 
ods are generally more sensitive than fully coherent 
searches. 011111] 

Even though gravitational-wave (GW) detectors are 
highly precise instruments, they still produce compli¬ 
cated data sets with many noise components. These 
are not all fully modeled by existing data analysis proce¬ 
dures, and thus result in outliers of the detection statis¬ 
tics. To distinguish noise outliers from real signals, a lot 
of work is routinely invested in detailed investigation of 
search results and auxiliary data. Some of it can be saved 
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by modifying detection methods to produce less outliers 
in the first place. [13] 

Many outliers in CW searches are caused by so- 
called lines, narrow-band disturbances that are typically 
present for a sizable fraction of the observation time. 
Such persistent lines can have diverse instrumental or en¬ 
vironmental origins, such as harmonics of the electrical 
power grid frequency, of the detector’s suspension sys¬ 
tem, or from digital components. 

A separate class of noise artifacts are transient 
‘glitches ’ [151 lasting only for a few milliseconds or 

seconds. These are mostly relevant in searches for tran¬ 
sient GWs from compact-binary coalescences and ‘burst’- 
type events. However, there is a third class of intermedi¬ 
ate ‘transient’ disturbances: they are much longer than 
glitches, so that they are highly relevant for CW searches; 
but still much shorter than the full observation time, so 
that they are not easily addressed by methods for mit¬ 
igating persistent lines. Typical time scales range from 
less than an hour to at most a few days. 

Such medium-duration transients, of a linelike quasi¬ 
monochromatic type, were noticed in a semicoherent 
search for CWs from the Galactic center with two years 
of LIGO S5 data [25U2H] . based on the matched-filter 
J--statistic [211 El] and the global-correlations (GCT) 
semicoherent search method mi Eg. In that search, 
many strong outliers could be traced back to narrow- 


^ These time scales are called ’very long’ in Ref. m, compared to 
the classical ms-s ’bursts’; but are merely ’medium’ compared 
to the months or years spanned by CW searches. 
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band disturbances in the data happening only within 
a single segment (each 11.5 hours long) of data from 
a single detector. Similar transient single-segment dis¬ 
turbances have also been found in LIGO S6 data, using 
60-hour segments for one year of data. m 

In the Galactic-center search a permanence veto was 
introduced [2HE5] to remove any candidates for which 
a single segment contributed excessively to the semico- 
herent multi-detector detection statistic. It was proven 
to be very effective, and also safe regarding classical GW 
signals, which are persistent over time scales comparable 
to the full length of the data set. [MIES] However, in 
a semicoherent CW search over several months of data, 
such a veto also suppresses nonpersistent signals with du¬ 
rations similar to a segment length, i.e., only a few hours 
or days: these would produce just the same data signa¬ 
ture as a disturbance in terms of single-segment, multi¬ 
detector statistics. Such Hransient-CW' signals (tCWs), 
following the standard GW signal model but for a limited 
duration, are also considered viable candidates for detec¬ 
tion [311 [32] , with several possible emission mechanisms 
from perturbed neutron stars [33U38] . 

Therefore, this paper investigates an alternative ap¬ 
proach to the permanence veto, constructing a detection 
statistic that is robust against single-detector transient 
artifacts, while at the same time being more sensitive 
than standard statistics to transient signals that are co¬ 
herent across multiple detectors. 

Two methods for detecting medium-duration transient 
signals have been previously proposed. One approach is 
to extend a coherent J^-statistic-based GW search to the 
case of tGWs by including their duration, start-time and 
shape as free parameters in the search grid. [Ml [32] This 
is nearly optimal in the Neyman-Pearson sense [39] . but 
computationally expensive due to the increased dimen¬ 
sionality of the search space. Alternatively, an un¬ 
modeled ‘excess power’ detection method originally used 
to search for GW bursts of at most a few seconds has 
recently been extended to cover longer durations. [23] 
While not specifically aimed at or optimized for tGWs, 
it could also be sensitive to this type of signal. Due to 
the different signal models, data processing and search 
methods of the GW-based and burst-based approaches, 
direct comparison of their tGW sensitivity is a difficult 
open question; and neither of these approaches has yet 
been used for an analysis of actual interferometer data. 

In contrast, the approach in this paper is a pragmatic 
extension of the line-robust statistics of Ref. [13] (here¬ 
after Paper I), which in turn are based on the stan¬ 


dard matched-filter J^-statistic [M] EH] . The J^-statistic 
is close to optimal as a detection statistic for persistent 
GWs in Gaussian noise uni, which in current detectors 
is a good model for the noise distribution over most of 
the observation time and frequency range. (See, e.g.. 


Refs. [IHIIISIIIII-) In fact, the J'-statistic corresponds to 
a binary hypothesis test between a GW signal hypothesis 
and a Gaussian-noise hypothesis. [ID] 

In the line-robust statistics from [Paper I| the noise 
model is extended to include persistent single-detector 
lines, without any detailed physical modeling of the lines’ 
origin: the idea is to simply model a line as identical 
to a CW signal, but confined to a single detector. We 
summarize these developments in Sec. [H] 


In Sec. Ill of this paper, the new material begins 
with a further extension of the noise model that also 
includes transient disturbances - or, more specifically, 
single-segment, single-detector disturbances. With this 
approach, CW searches now become more robust to¬ 
wards both persistent and transient single-detector dis¬ 
turbances. It can reproduce the robustness of the perma¬ 
nence veto when considering persistent CW signals only, 
while not being as strict in suppressing transient tCW 
signals. 

A second step, described in Sec. |IV[ aims to improve 
the sensitivity of semicoherent J^-statistic-based searches 
towards transient signals, hence reducing the need for 
more specialized tCW searches. We achieve this by also 
including an explicit signal model for transient CW-like 
signals on a time-scale corresponding to a single seg¬ 
ment in a semicoherent search. We then test these ex¬ 
tended detection statistics in Sec. jVj using simulated 
data with a realistic distribution of gaps in observation 


times, and conclude in Sec. VI A comparison to other 
search methods for medium-duration transient GW sig¬ 
nals [11I3I1I32] remains a topic for further work. 


II. SUMMARY OF EXISTING SEMICOHERENT 
DETECTION STATISTICS 

This section briefly summarizes previous work on 
how the matched-filter J^-statistic [211128] follows from 
Bayesian hypothesis testing [JD], on the permanence 
veto [Ml - ED] , and on Paper I s extension of the Bayesian 


See Appendix A.3 of Ref. m for computing cost estimates. 


approach to produce line-robust detection statistics. 

This section also serves as an introduction to the no¬ 
tation used in this paper. x^{t) denotes a time series 
of GW strain measured in a detector X. Following the 
multi-detector notation of [28l [42] , boldface indicates a 
multi-detector vector, i.e., x{t) is the multi-detector data 
vector with components x^{t). 

For Bayesian hypothesis testing [JH], P{H\x,X) is the 
probability of a hypothesis % given data x and prior 
information T. Posterior odds ratios between two hy¬ 
potheses 'HA^ P-B are written with an uppercase symbol 
Oaib{xj^)] if ’Hb is the logical sum of two hypotheses, 
T-Lb = {T-Lc or T-Ld), we write Oa/b{x,I) = Oaicd{x,X). 
The corresponding prior odds take a lowercase symbol, 
oa/b{^)^ and the likelihood ratio or Bayes factor is 
Ba/b{x,I), so that Oa/b{x,I) = oaib^) Ba/b{x,X). 









3 


Also, in this paper semicoherent quantities from a 
search with a number Nseg of segments carry a hat, such 
as J-. Coherent single-segment quantities have a tilde 
above the symbol and an upper index i = I... Nseg enu¬ 
merating the segments, such as 


A. The J^-statistic: signals in Gaussian noise 


We start with a Gaussian-noise hypothesis, 
'Hg ■ x{t) = n{t), with the samples of n{t) drawn 
from a Gaussian distribution. Its posterior probability, 
given priors P {%g ^ and P {x\I), is 


i^G 


x,l] = 


P 


(Hg|x) 


P{x\I) 


Ke 


-^{x\x) 


( 1 ) 


with a normalization constant k and a scalar product 
between time series defined as 


ix\y) ^ f x^{t)y^{t)dt . 


( 2 ) 


Here, are the single-sided power-spectral densities 
(PSDs), assumed as uncorrelated between different de¬ 
tectors and constant over the (narrow) frequency band 
of interest. 

The CW signal hypothesis 


Hs ■ x{t) = n{t) + X) (3) 


contains a waveform with a set A of four amplitude pa¬ 
rameters and a set A of phase-evolution parameters (in¬ 
cluding frequency, spin-down and sky position). In a 
semicoherent search, different A^ are allowed in each 
segment; but we simplify our notation by redefining 

A = {A^). 

After marginalizing over A and the associated prior 
(cf. [131 [211 Sni mi), the posterior probability is 


P 


(jds 


X,I] = 


OS/G 


{i)p{nG 


x,X] e' 




■ (4) 


Here, bs/G(T) = P (PslP) /P (A^g\P) are the prior odds 

between a signal and Gaussian noise; € (— 00 , 00 ) 

is a free parameter (the result of an arbitrary A-prior 
cutoff); and the semicoherent multi-detector J^-statistic 
is, for a single parameter-space point A, given by the sum 
over single-segment coherent J^^-statistics: 

Affeg 

.F(a;;A) = ^.F'(a:^A). (5) 

i=i 


In practice, often an interpolating StackSlide algorithm 
is used, where P{x-,X) for each A is computed from a set 
of P^{x^;X^) with the A^ picked from a coarser grid in 


parameter space than the A. [H ini US HU. in Eq. @, 
as well as for the rest of the paper, we do not explicitly 
show the A-dependence of our detection statistics. 

These posterior probabilities can be used to compute 
odds ratios between the different hypotheses. First, we 
see from Eqs. Q and Q that 


Os/g{x,X) 


p 

Ps 

x,I^ 

p 

[Pg 

x,T^ 


oi Bs/g{x,I) , (6) 


i.e., this Bayesian approach reproduces the J^-statistic as 
the Neyman-Pearson-optimal detection statistic for CW 
signals in pure Gaussian noise and under the assumed 
priors. The free parameter is irrelevant in this case. 


B. Permanence veto 


The permanence veto, as introduced in Refs. |23HlS]j 
works by the following algorithm: From a fixed Gaus¬ 
sian false-alarm level or some real-data noise studies, a 
threshold J^thr is set on the average semicoherent statis¬ 
tic P = P/Nseg- Then, for each candidate the highest 
single-segment contribution is removed, defining 

where m is the segment with the highest multi-detector 
.F™ = max£.F^. 

In the original implementation of Refs. [21U2H] , the F'pv 
value of each candidate is compared with the threshold 
J^thr to determine whether to veto the candidate. In our 
tests in Sec.jVj we slightly modify this algorithm to treat 
the permanence veto on a more equal footing with the 
other detection statistics: We define F'pv exactly as in 
Eq. but we set a detection threshold by computing 
the maximum of P^„ over a pure-noise data set. 


C. Line-robust statistics 


[Paper I] introduced a more general noise model 
including a simple noncoincident ‘line’ hypothesis 
"Hl : x^(t) = n^{t) -\- h^{t]A^), which just assumes a 
CW-like disturbance in an arbitrary single detector X. 
It leads to a line-robust detection statistic which is re¬ 
produced here with slightly updated notation. 

Marginalisation as for Eq. Q yields 


-^det 

p {nL\x,i^ = p {nG\x,i^ 




( 0 ) 


X^l 


( 8 ) 
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with the per-detector line-prior odds and their sum, 




ol/g(2^) = X!°l/g(^) ■ 


(9a) 

(9b) 


We suppress the I-dependence of any odds ratios or 
Bayes factors in Eq. Q and from now on. 

Furthermore, we can combine the (mutually exclusive) 
hypotheses Hg and "Hl into an extended noise hypothesis 
■Hgl = iP-G or "Hl), with posterior probability 


P 

= P 


(Pgl 
(Pg 


x,X\ = P 


{Pg\x,X) + P {p^\x,X) (10) 


Na. 






Finally, using Eqs. 0 and ( [To| ), we obtain generalized 
signal-versus-noise odds 


Os/gl(®) — 


Os/G 


e-7"(==) 




( 11 ) 




and, with the conditional probabilities for lines in the 
absence of a signal. 


Pl = P (Ph PGh,P'j = Y 
= P (p^\PguP) 


Ol/G 
+ Ol/G 

°L/G 


(12a) 

(12b) 


1 + Ol/G 

the corresponding Bayes factor, or line-robust statistic, is 


Ps/Ghix) — 


p.7"(x) 


(1 - pl) 


(13) 


In this statistic, the parameter determines a tran¬ 
sition scale between increased strictness to either Gaus¬ 
sian noise or lines. It can therefore be considered as a 
tuning parameter for the line-robust statistic. In sec¬ 
tion VLB of [Paper 1] it was suggested to choose the low¬ 
est that makes Bq/gl as efficient as P for simulated 
CW signals in quiet (almost-Gaussian) data, and demon¬ 
strated that this tuning choice at the same time offers 
improved robustness against lines. 

The limit of P^'^ —>■ — oo corresponds to a binary test 
of P'S, against Pl, excluding Gaussian noise. We refer to 
this Bayes factor Bs/l as the pure line-veto statistic. 


III. DERIVING A CW DETECTION STATISTIC 
THAT IS ROBUST TO SINGLE-SEGMENT 
DISTURBANCES 


Going beyond the noise model of jPaperTl we now turn 
to the issue of noncoincident transient linelike distur¬ 
bances. To address it in the same Bayesian framework as 
above, consider a new ‘transient-line’ hypothesis P^ for 
a quasiharmonic disturbance in a single segment £ and 
single detector X: 


By : x^\t) = n^\t) + h^\t-, . (14) 


This is just the full GW hypothesis from Eq. ([^ re¬ 
stricted to a subset x^^{t) of the data. Thus, in analogy 
with Eqs. Q and (|^ and dropping the time-series argu¬ 
ments again, the posterior probability for P^ is 


P 


(«y|,T«,i)=p(«y|a-”,i) 


Gl/g ® 




(0) 


(15) 


In principle, we could now build up a wide range of 
composite hypotheses about the whole data-set x, span¬ 
ning Vseg X Vdet subsets x^^(t), by combining instances 
of By and of the single-segment Gaussian-noise hypoth¬ 
esis Pq^, and by setting appropriate constraints on the 
amplitude parameters 

For example, the hypothesis Pl for persistent single¬ 
detector lines corresponds to P^ with the same A^^ 
for all £, but only for a specific detector X = Y; combined 
with Pq^ for all other detectors X . 

However, we concentrate on one specific new full-data- 
set hypothesis PtL'- for the case of a transient, single¬ 
detector disturbance in only one £ and one X, with no 
prior constraint on the values of these indices. For exam¬ 
ple, if we have data in two segments for two detectors, 
the full hypothesis is 


BtL : Pg ^G ^G ) (16) 

or ^Bq and B(l and Bq and Bq ^ 
or ^Bq and Bq and B^l and Bq ^ 
or ^Bq and Pq and Bq and B^l) . 


The full semicoherent posterior probability for this hy- 
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pothesis is then 


XZ "■ 


or 


X^^\X 


p(PG|=r,x) (17) 


xt 


Xses A^det 

introducing the shorthand notation X) = S S • 

xt i=\ x=i 

We can then produce a combined noise hypothesis 
PcLtL that allows for either pure Gaussian noise, a per¬ 
sistent line or a single-segment transient line: 


'HcLtL : (Pg or Pl or Ptn) • 


(18) 


As seen before in |Paper I 'Hj^(A^) has the same like¬ 
lihood as Pq in the special case of vanishing amplitude 
parameters, = 0. But when we obtain the full line 
hypothesis by marginalizing over A^, this is only 

a null-set contribution; furthermore, the two hypotheses 
are still, by construction, logically exclusive. The same 


reasoning applies to Hence, the probabilities of 

these three hypotheses must simply add up: 


Pi 


(PGLtL 

x,lj 

= p(Pg 

x,I^ + P (Pi^ a 

= p{Pg 

/ Xdet 

\ x=] 



Xi 


■^X T^ — N 

oh/G 


XI 

tL/G 0 


■ (19) 


Then, the odds ratio between the classical persistent- 
CW signal hypothesis Ps and the combined triple-noise 
hypothesis PGLtL yields a new detection statistic 


Os/GLtL = o^s/G 


f _ Wdet 

^ X=1 


. (20) 

X(. / 


where, just as a reminder, the semicoherent P-statistics 
are P = Yl,t and P^ = P^^. 


With the total prior disturbance odds Ol/g — X)X/G 
and OtL/G = S^^/G’ introduce the following short- 
hands for prior probabilities conditional on the composite 


noise hypothesis "Hglil, generalizing the pn and from 
Eq. pi): 




xxe 

PtL 


PLtL 


PH 


X 


'HGLth,X] = 


A/g 


1 + Ol/g + OtL/G ’ 
xxt 




'Pgul,X] = 


Al/g 


(PLtL 


PGLtL,T = 


1 + Ol/G + OtL/G 
OLtL/G 
1 + OLtL/G 


(21a) 

(21b) 

(21c) 


This allows us to write the corresponding Bayes factor as 


Ps/GUh = 


(1 — pLtn) +J2Ph + ^Pg^ 

X XI 


( 22 ) 


We see that the difference between (i) the persistent- 
line term already present in the Ps/GL of Ref. (13) and 
(ii) the newly introduced transient-line term is that we 
have either (i) a sum over X of the exponentials of a sum 
over i of P^^, or (ii) a double sum over X and t of the 
exponentials of each individual plus a large constant 
term (A^seg - l)pi°^- 

Hence, if there is a strong disturbance in a single {X, tj 
combination and if the transition-scale parameter 
has been chosen as higher than the typical P^^ in pure 
Gaussian noise (in accordance with the tuning proce¬ 
dure described in Sec. VIB of Paper I), then the lat¬ 


ter term can dominate in the denominator. This will 
make Ps/GLtL stricter in suppressing these transient dis¬ 
turbances than Ps/GL- 

We could have introduced an additional free tuning pa¬ 
rameter into Ps/GLtL by using a different cutoff on the 
AX£ pj-jQj. than for the A^ in , resulting in a 

different pi°^^ appearing. However, we already have free¬ 
dom in the relative weights of persistent and transient¬ 
line contributions through the and p^^, and there 
is no clear physical motivation in such a complication of 
the amplitude priors (which were chosen ad hoc, to repro¬ 
duce the P-statistic, in the first place, cf. Refs. EH SO])- 
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Hence, we refrain from this possibility, and use the tests 
in Sec. ^to demonstrate sufficient flexibility of Hg/cLtL 
without it. 

As the denominator of Hg/QHL is a sum of exponen¬ 
tials (or weighted exponentials, but of course the log of 
the weights can be absorbed into the exponents), it is 
often dominated by a single term. The same is true for 
-Ss/GL, and its limiting behavior in various cases was dis¬ 
cussed in Sec. IVBl of [Paper 1} Here, we just give an ex¬ 
pression for In Hg/GLtL written as a sum of the dominant 
term and a logarithmic correction. 


a modified ‘loudest-only’ detection statistic 

Bs/gulao — ((1 ~ Plil) (26) 

^det ^det _ ~ \ 

X=1 X=1 ) 

This quantity could, in principle, differ quite significantly 
from the actual Bayes factor Hg/cLtL- There is also no 
guarantee that it is as efficient a detection statistic under 
our initial hypotheses, so we will test its efficiency with 
simulated data in Sec. jV] 


In ^S/GLtL —B— bmax — In 



(23) 


where i?max = maxi? is the maximum of the set of ex¬ 
ponents, with 1 -I- A^det(l + -^seg) elements: 


IV. DERIVING A DETECTION STATISTIC 
FOR PERSISTENT OR TRANSIENT SIGNALS, 
ROBUST TO PERSISTENT OR TRANSIENT 
SINGLE-DETECTOR LINES 


V = [ VsegJ'i°^ -f In (1 - pLtn), -f InpL : 

^ ^ ^ ^ (24) 

+ (iVseg - l).Fi°^ + lnp«} . 

In computer implementations, this form is useful both 
for numerical stability (avoiding underflows) and to speed 
up computation when the correction term can be ne¬ 
glected, InHg/GLtL ^ T - D max ■ 

We also consider an intermediate step where we re¬ 
duce the ^-sum in the denominator to the highest per- 
xe 

segment contributions from each detector, but keep the 
remaining 1 -|- 2Vdet terms. This will reduce computa¬ 
tional cost while also corresponding to the initial assump¬ 
tion of a single-segment disturbance: again, because of 
the exponentials, a single significantly increased F"^^ will 
easily dominate over all others. Hence, in many cases a 
good approximation to the Bayes factor is given by 

Ss/GLtL « /((1 - pLtn) (25) 

+ Epl 

X X 

where im(X) is the segment number for which 

~Xm(X) ^XTCi(x) 

PtL ^ =max^(^pfLe )■ 

In some applications, purely for reasons of search code 
simplification and reduction of data volume, only reduced 
single-segment information may be available: the set of 
values {F"'", only for the segment m with the 

highest multi-detector F"* = max^F^. To still obtain an 
approximate version of Bg/GLtL in such cases, we define 



CW-like transient signals might be interesting search 
targets. [311 E21 EH] One might now anticipate that the 
transient-line-robust Bayes factor Fg/GLtL from Eq. (22) 
is too restrictive towards these, as a multi-detector- 
coherent signal in a single segment can increase the de¬ 
nominator of Eq. (22) more than the numerator. 

However, the approach of considering more general hy¬ 


potheses built up from the set {F^, {F'^^}} should actu¬ 
ally allow for more sensitivity towards transient signals 
than any detection statistic based only on the total semi- 


coherent results, like F and Os/GL- 

So we try to improve over Fg/GLtL by deriving another 
generalized detection statistic, answering the following 
question: how likely is any type of CW-like signal (per¬ 
sistent or transient), in comparison with Gaussian noise, 
a persistent line, or a transient line? 

Starting from the full set of single-segment 
{F^,{F^^}}, the most general answer would in¬ 
volve a large set of hypotheses for signals in different 
numbers of segments. But here we keep to the simplify¬ 
ing assumption of single-segment transients, introducing 
a transient-signal hypothesis as the multi-detector 
version of Eq. (|T4| : 


nis : x‘{t) = n\t) + h\t- X). (27) 


Note that this is different from the single-segment, single¬ 
detector transient-line hypothesis from Eq. (141 only 
if the data set for segment I contains data for at least 
two detectors X. In this section, we assume this to be 
the case for the whole data set. However, in the real 
world the components of a multi-detector network often 
have different duty factors and standard data selection 
methods [45| can result in segments with data from one 
detector only, or with negligible amounts of data from 
the other detectors. 
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We test the robustness of this detection statistic, de¬ 
rived with the assumption of full segment coverage by all 
detectors, by considering a data set with realistic duty 
factors in Sec.|Vj and discuss ways to deal with the slight 
issues it can cause in Sec. IV Cl 

Let us continue from the posterior distribution for , 
which is analogous to Eq. (15): 


P 


{Hi^\x^,l)=P{n^Gya) ofs/ce' 




(28) 


Just as for the various noise hypotheses, we can also 
add up the probabilities for the signal hypotheses Hs and 
"Hts to evaluate a more general persistent-or-transient 
‘CW-like’ hypothesis: 


P 


(^StS 


X,I\ = P 


= p 


(P-G 


x,X 


{ns\x,i'^ + P (Pts 

Wseg 


Os/G 


X,I) (31) 

’ ~e 

^tS/G ® 


e=i 


The odds ratio between generalized signal hypothesis 
and generalized noise hypothesis is then 


The hypothesis Pts for a transient signal in an arbitrary 
segment is the logical OR combination of 'Hfg analogous 
to Eq. (16), so that the posterior P (Pts is ob¬ 

tained in analogy with Eq. ([T7|: 




P =Y.P (h[^\x\i) n P {Pi 

I'M 






= p 


(nG\x,x) 5] ■ (29) 


Testing for tCW signals only, this yields an odds ratio 


O 


tS/GLtL - X! °tS/G 




(30) 


Ol/g e 


E- 

XI 


OtL/G ® 


OstS/GLtL — I Os/G 0'^+ E °tS/G 


ZW(iV.eg-l).F(«) 


(32) 








xe 


The corresponding generalized Bayes factor follows by 
introducing additional prior-weight variables in analogy 
to from Eq. (21): 


PS = P 


(Ps 


Psts,P] = 


Os/G 


Os/G + OtS/G 


N^, 


e=i 


= (1 -pts) = 1^1 - E 

for persistent signals and 

Psts,p) = 


~e 

Pts 


(33) 


fts = P {Pts 


tS/G 


Os/G + OtS/G 


(34) 


for transient signals. 

This persistent-or-transient ‘CW-like’ robust detection 
statistic is then given by 


B. 


StS/GLtL 


(l-pts)e^ + EPtse^+('^“*-'>^* 


( 0 ) 


(1 - PLtL) ' +EPl +Ep^l 

X xe 


(0) • 


(35) 


As was the case for Bs/gul from Eq. (22), additional 
freedom in tuning this statistic could be obtained from 
different amplitude-prior cutoffs in "Hl , XLth and now also 


Tis and Tits ■ But again we restrict ourselves to using the 
same cutoff, resulting in a single tuning parameter , 
and use only the set of prior variables {psiPtsP'^e'LiP^} 
as weights for the various contributions. 

Next, we consider the logarithm of this Bayes fac¬ 
tor, splitting numerator and denominator separately into 
sums of a dominant term and a logarithmic correction. 


which generalizes Eq. (23): 


InBs/GLtL — A„iax + In I ^ f 

\e^E 

f^max In I ^ ( 


E-E„ 


.D-D„ 


KDeT) 


( 36 ) 




























V. TESTS ON SIMULATED DATA 


where -Dmax is the maximum of the same set of denom¬ 
inator exponents given in Eq. (241 and Emax = max £ is 
the maximum of the numerator exponents: 


£=[f + Inps, + (A'.eg - + Inpfs} ■ (37) 

For transient signals and disturbances that are in¬ 
deed limited to a single segment (or reasonably close), 
it should suffice to compute an approximate Bayes factor 
using only the maximum single-segment contributions: 


B. 


StS/GLtL ~ e + PtS ® ® ) 


(l-PLtL)eV(“' 




(38) 


X 


Xm(X) + 




where m is the segment number with the largest multi¬ 
detector contribution, so that e'^’” = max^ ^"^0 ’ 
and m(A) is the analogous segment number for each de¬ 
tector: =max,(p«e^"'). 


As in Eq. (261, we also define an ad hoc modified 
‘loudest-only’ detection statistic where we use only in¬ 
formation from one segment m with the highest multi¬ 
detector T™ = 


In this section, we present some tests of the new Bayes 
factors )3s/GLtL and tSsts/QHL in the form of injection 
studies on simulated data, where simulated CW and tCW 
signals (‘injections’) are recovered from simulated noise. 
We use the same basic injection procedure and detection 
criteria as described in Sec. VIIB ofjPapeFTj 


A. Search setup and data sets 

For two reasons, it is important to test these detection 
statistics with realistic data and a search setup that is 
close to what is used in practice: First, the approach in 
this paper is to provide a simple extension of the estab¬ 
lished search codes that already produce the .F-statistic 
and line-robust statistics, which should be directly appli¬ 
cable in current search efforts, and hence tested in similar 
circumstances. Second, as we are interested in transient 
features, the time-domain characteristics of real data sets 
are important for any performance demonstration, espe¬ 
cially the occurrence of gaps in the data: it is necessary 
to test that gaps do not lead to persistent CW signals be¬ 
ing rejected, or to a smaller improvement in sensitivity 
towards tCW signals than in perfectly continuous data. 

Hence, we use fully simulated data, but with realistic 
duty factors taken from the real LIGO S6 data. One 
data set contains pure Gaussian noise, whereas an addi¬ 
tional transient non-Gaussian disturbance is present in 
the second data set. 


-“StS/GLtL.lo = [PS e -\-Ptse \ e I , j 






(39) 




X / 

Again this requires empirical tests to verify that it is 
close in efficiency to the full Bayes factor, which will be 
demonstrated in Sec. [V] 

Alternatively, in a search for tGWs only, or for GWs 
and tGWs with two separate toplists, one could use the 


Bayes factor corresponding to Eq. (30): 




-BtS/GLtL — 2_^PtS 6' 
£ 


((1 - Pul)' 




( 0 ) 


+ EPLe^ 


;X£ + 


■ (40) 


X 


X£ 


All these expressions also simplify significantly if all 
Ojg/G = Os/G s-nd = X/G’ ’'^hich we assume for 

most of the test cases in the next section. 


1. Search setup 

Our search setup mimics the Einstein@Home [46] 
‘S6Bucket’ search m on LIGO S6 data: we use data 
spanning about 255 d, analyzed semicoherently with 
Nseg = 90 segments of Tgeg = 60 h. 

The analysis is performed with the 
HierarchSearchGCT code [25], a semicoherent Stack- 
Slide implementation based on the GOT method of 
Ref. mi. We use the same search grids as the S6Bucket 
search, covering the whole sky and only the Hrst-order 
spin-down parameter /. HierarchSearchGCT is lim¬ 
ited to semicoherent refinement in spindown only (by 
a factor yj.) but not over the sky, a limitation that 
has been identified as an important point for future 
improvement. Hiisn] 

The search output is a toplist of the most significant 
candidates ranked by one of the semicoherent statistics 
iF or Hs/GL- For this study, we have modified the code 
to also return the single-segment and F^^-statistics 
for each toplist candidate. 

We first analyze a 50 mHz band of each simulated 
noise-only data set (purely Gaussian and Gaussian + 
transient disturbance), and obtain the maximum of each 
detection statistic over the whole sky and /, / range. 
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Then, for a set of fixed signal strengths ho, CW or tCW 
signals with otherwise random parameters are injected 
into the same noise realization, and searched for again 
over a smaller search box. This is a subset of the original 
search grid containing (but usually not centered on) the 
injection point. A signal is considered as detected if the 
highest value from this search box exceeds the maximum 
value from the pure-noise search. 

The search parameters for both the full-band noise- 
only search and for the smaller injection search boxes 
are given in Table |l] In all test cases, 1000 signals are in¬ 
jected per ho value, with a range chosen so that detection- 
efficiency curves are well-sampled over the whole range 
from 0 to 1. The signals are drawn with random ampli¬ 
tude parameters cost, /po, 'h; and with /, / and sky po¬ 
sition randomly distributed over the full search range as 
given in Table |l] The distribution of tCW-specific time- 
domain parameters is discussed below in Secs. |V CHV P] 


two toplists with 1000 candidates each. One toplist 
is sorted by T and one by the pure line-veto statistic 
i3s/L(o^j^/G “ '''^hich corresponds to Bs/GL in the 

limit of —)■ — cxD. All other detection statistics are 

then computed from the union of these two toplists. 

In principle, this procedure could lead to some noise 
outliers or some injections being missed for the ‘recom¬ 
puted’ statistics. However, the two toplists (classic T- 
statistic and pure line-veto statistic) are very ‘orthogo¬ 
nal’ in the sense that one is nearly optimal for Gaussian 
data and one is tuned towards strong disturbances, so 
that candidates that would be significant by one of the 
other Bayes factors are very likely to appear in at least 
one of these two toplists. Also, tests with longer toplists 
have found that this approach is generally sufficient to 
not lose any would-be high-significance candidates of any 
recomputed statistic by having them below the threshold 
of both ranking statistics. 


Another point where we construct our procedure in 
analogy with the SfiBucket search is the ranking of can¬ 
didates in the toplists kept by the HierarchSearchGCT 
code. For each search job (51 sky partitions per noise- 
only search, or one search box per injection) we keep 


Common search parameters 

Detectors 

LIGO HI, LI 

^start [s] 

949469977 

^end [s] 

971529850 

^seg, NtJg 

90, 89, 90 

Tseg 

60h 

Frequency resolution 5f 

« 1.6143 X 10"® Hz 

Spin-down resolution Sf 

« 5.7890 X 10"^^ Hz2 

f refinement factor 

230 

Nominal sky-grid mismatch 

0.3 

Original toplists 

F and Bs/o{d^/Q = 0.5) 

Toplist length 

1000 

Full-band search parameters 


min/ 

50.0 Hz 

Frequency range A/ 

0.05 Hz 

min/ 

Ri -2.6425 X 10-®Hz2 

Spin-down range A/ 

Ri 2.9067 X 10"® Hz® 

Sky points Asky 

707 

Search jobs (sky partitions) 

51 

Purely Gaussian data max2J^ 

6.374 

Transient-line data max 2F 

11.985 

Persisent-line data max 2iF 

42.246 

Per-injection search box 


/ range 
/ range 

Sky points 

0.001 Hz 
« 2.3156 X 10"“ Hz® 
(4 coarse-grid points) 
10 


TABLE I. Search parameters for pure-noise (full-band) and 
per-injection searches with lalapps_Hi6rarchSearchGCT. 


2. Simulated data sets 


To generate our data, we used the duty factors of the 
HI and LI detectors for the data selection of the Ein- 
stein@Home SfiBucket search on LIGO S6 data: this 
gives us 6156 Short Fourier Transforms {SFTs) in HI 
and 5924 SETs in LI, each SET 1800 s long, with realis¬ 
tic gaps in between. 

The data selection method [IS] used to generate the 
S6Bucket segment list was optimized for total sensitivity 
and did not ensure uniform duty factors over segments 
and detectors. Hence, it happens to have two particularly 
unequal segments, where one detector contributes no or 
very little data (compared to an average of 67 SETs per 
segment and detectors): segment 64 (of 90) has no data 
from detector HI, and segment 76 has only four SETs 
from detector LI. 

The first ‘quiet’ data set is pure simulated Gaus¬ 
sian noise, from the Makef akedata_v5 code [48], 
with the sensitivity of the two detectors be¬ 
ing realistically slightly unequal: the single-sided 
PSDs are = 3.2591 x lO-^^ Hz"^/^ and 

= 2.9182 X 10-22 Hz-1/2^ 

The per-detector normalized SET power 




SFT 


(/)^ 


A^sft 

—?— y 

A^sft Tsft “ 

Oc—1 


-if)\ 


y(/) 


(41) 


for this data set is shown in Fig. [l] both as a frequency- 
dependent average 'Psft(/) whole data set and 

in the form of a single-segment maximum max^T^^^rp 
over SFT frequency bins, as a function of segments £. 
The apparent outlier maxfVgp^'^^ « 3.9 is just an ef¬ 
fect of low-number statistics, as segment 76 contains only 
four SFTs from detector LI. 
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FIG. 1. Pure Gaussian noise data set. Top panel; normal¬ 
ized SFT power 'Pgprj, averaged over all N^eg = 90 segments. 
Bottom panel: single-segment rasafV^^, maximized over 
SFT frequency bins, as a function of segments £. 


We have also generated a second data set contain¬ 
ing a transient single-detector disturbance. We started 
with an independent realization of Gaussian noise with 
the same time stamps and PSDs as the first set, 
and then used the otherwise equivalent implementation 
Makef akedata_v4 [35] to inject a stationary line fea¬ 
ture with fixed amplitude /iql = 4 x 10“^^ and frequency 
/l = 50.025 Hz in a single detector (HI) during a single 
segment £i^ = 10. A transient line in a single segment is 
chosen because, as discussed in the introduction, most 
strong disturbances in LIGO S5 and S6 data are indeed 
either persistent over the whole observation time, or over 
only a single segment. [551 ISO] 

The normalized SFT power for this data set is shown 
in Fig. We see that the disturbance produces a very 
high single-segment max/ « 15. It is also strong 

enough to show up in the average T’^rp(/) over the whole 
data set, but in this average it is much weaker than the 
persistent lines studied before (cf. Fig. 7 of Paper I). 

This simulated disturbance is similar to a family of 
transient disturbances in LIGO S6 data informally called 


® As of the writing of this paper, the newer MFD_v5 code did not 
support stationary line injections. 


FIG. 2. Data set with Gaussian noise and a single-detector 
stationary line injected for the duration of segment £ = 10 in 
detector HI. Panels are the same as in Fig. [7] 


‘pizza-slice disturbances’ |3^ due to their shape in three- 
dimensional plots of J^-statistics against frequency / and 
spin-down /. Fig. [^presents such a plot for our simulated 
data set. Though a sharp line in semicoherent 

search sees this transient disturbance as a wide structure 
in parameter space. Different templates match the dis¬ 
turbance at different times, leading to the ‘pizza-slice’ 
shape. The simulation result is somewhat narrower than 
the typical LIGO S6 ‘pizza slice’, since its duration is 
a whole segment of Tgeg = 60h, while the corresponding 
disturbances in S6 data typically last only for a few SFTs. 

We have also generated a third data set with the same 
procedure as the second, but with the single-detector 
disturbance active over the whole observation time, i.e. 
as a persistent line. In such a case, the new transient- 
optimized Bayes factors Hs/glil and Hgts/GLtL cannot 
be expected to yield further improvements over the de¬ 
tection efficiency of the persistent-line-robust statistic 
Hg/GL- Still, we have verified that in this case there are 
no losses either compared to Hg/QL, with both new Bayes 
factors reproducing the performance found for Hg/QL in 
and improving over the standard JF-gtatistic. To 
avoid redundancy with that paper and the purely Gaus¬ 
sian case (data set 1), this set of results is not shown and 
discussed in detail here, our focus being instead on the 
cases where improvements can be made. 


Paper I 




















































11 



FIG. 3. Data set with Gaussian noise and a single-detector 
stationary line injected for the duration of segment £ = 10 
in detector HI. The figure shows the average multi-detector 
semicoherent 2J^-statistic, over 90 segments, with the ‘full- 
band’ search parameters listed in Table as a function of 
frequency / and spin-down /. 


3. Tuning the free parameters of the line-robust statistics 

A global value for the transition-scale parame¬ 
ter is determined through requiring safety in 

quiet data, choosing the minimum value required 
to have negligible differences in detection probability 
to the J^-statistic. This statement holds down to 
the false-alarm level probed by this study, which is 
bounded by the inverse number of fine-grid templates 

(A^tempi = 7r-^sky ^ ^ « 2.3 X 10^^^ in the search 
setup, but is effectively somewhat higher due to template 
overlap. 

For the original i?s/GLi we re-use a value of 

« 3.027 found in more extensive studies on LIGO 
S6 data m- The present injection study on the pure 
Gaussian-noise data set is sampled in steps of 0.1 in 
which for i?s/GLtL and Ssts/GLtL leads to a value 
of « 3.0. The difference is negligible with respect to 
the sampling accuracy of 1000 injections per hg value, as 
shown by the results in the next subsection. 

For the per-detector line priors, we do not take into 
account our privileged knowledge from generating the 
data sets, instead testing the robustness of the detection 
statistics by a simple choice of dp/Q = = 0.001 for 

all X and £ in both data sets. This corresponds to the 
lower truncation suggested in section VI. A of [Paper I| 
as a conservative choice that considers lines as rare, but 


still keeps the line hypothesis open in case it is strongly 
preferred by the data. 

However, we also investigate the effect of setting 
= 0.0 in the two segments with no or small con¬ 
tributions from one of the two detectors. The rationale 
for this modification is that the single-segment signal hy¬ 
pothesis "Hfs of Eq. (27) becomes indistinguishable from 
our transient-line hypothesis 77^^ of Eq. (14) when that 
segment is completely dominated by a single detector. 

For any future searches of LIGO data using these 
statistics, tuning of both the transition scale and the 
per-detector line-priors will be revisited using the specific 
search setup and data characteristics. 


B. Persistent CW signals 

For persistent GW signals, the injection procedure is 
identical to that in jPaperl] Fig. a shows results in the 
form of detection probabilities pdet for the various statis¬ 
tics as functions of the scaled signal amplitude hg/y/S. 



/io/Vd[l/Hz] 


FIG. 4. Detection efficiency Pdet for persistent CW signals, 
as a function of scaled signal amplitude ho/y/S, for the fol¬ 
lowing semicoherent statistics: ^pv (permanence veto), 

Bs/GL from Eq. Sg/GUL from Eq. and SgtS/CLtL 

from Eq. | |35| |. The dashed horizontal lines mark pdet = 90%. 
Top panel: injections in pure Gaussian noise. Bottom panel: 
injections in Gaussian noise with a transient disturbance. Sta¬ 
tistical uncertainties are smaller than the plot markers. 
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As discussed in the previous subsection, tuning 
= 3 allows both i?s/GLtL and -Bsts/GLtL to match al¬ 
most perfectly the detection efficiency of the ^-statistic 
and of Bs/GL in quiet Gaussian data, with maximum 
discrepancies in p^et of 1% (down to the false-alarm 
level of this search setup). These are smaller than the 
statistical uncertainties from 1000 injections, and could 
be resolved with a more detailed tuning. In this 
case, all statistics reach 90% detection probability at 
0.023. 

In the data set with a transient-linelike single-detector 
disturbance, T performs much worse, while Ss/GL loses 

a few % of pdet at any given /iq. Here, the new i?s/GLtL 
performs best with no degradation from the quiet case, 
still achieving j^fS « 0.023. Taking into account 
the possibility of tCW signals (which are not actually 
present in this case), Hsts/GLtL only sacrifices about 1% 
in pdet, and still improves significantly over Hs/gl- 


In both cases, using the simplified ‘loudest-only’ detec¬ 
tion statistics from Eqs. (261 and (39) with only one set of 
single-segment values (with m chosen so 

that = maxf JF^') does in fact not decrease detection 
efficiency. No extra curves are plotted for these statistics. 

Also, we see that the performance of the permanence 
veto [21H1S1 in the absence of tCW signals is closely re¬ 
produced by our new Bayes factors. 


C. tew signals of exactly one segment length 



/io/V5[l/Hz] 



FIG. 5. Detection efficiency p^et transient tCW signals 
with Tinj = 7"seg = 60h, randomly distributed over segments, 
for the same statistics as in Fig.[^ The dashed horizontal lines 
mark p^et = 90%. Top panel: injections in pure Gaussian 
noise. Bottom panel: injections in Gaussian noise with a 
transient disturbance. 


For the first set of transient signal injections, we sim¬ 
ulate CW-like signals that are active during exactly one 
segment, i.e. with fixed Tinj = T^eg = 60h and a start 
time corresponding to that of a randomly picked segment 
for each injection. Though not realistic, this configura¬ 
tion is useful as a first test of principle, where the as¬ 
sumptions made in the derivation of Sec. |IV| correspond 
exactly to the data, before generalizing the test to a more 
realistic signal population with varying transient dura¬ 
tions in the next section. Detection probabilities for this 
case are shown in Fig.[^ over both noise data sets (purely 
Gaussian and Gaussian -|- transient disturbance). 

Tim established semicoherent detection statistics T 
and Hs/gl achieve /ig°^/\/^ ~ 0.1 in the first, quiet data 
set. This is about a factor 4-5 worse than for persistent 
signals, which is actually already a smaller ratio than 

expected from the naive = ^/N^ scaling for a 

fully coherent search, but consistent with a more detailed 
StackSlide sensitivity estimation m- 

When going from the purely Gaussian to the transient¬ 
line data set, the performance of T and Hg/GL decreases 
somewhat more strongly for these tGW signals than it did 
for persistent signals, with Hg/GL losing up to 10% in pdet 


at some hg values and increasing to j'/S « 0.12. 

Considering the permanence veto, we confirm that it 
would effectively remove almost all of these tCW signals, 
and hence we indeed need an alternative detection statis¬ 
tic for this case. 

The Bayes factor Hs/GLtLi which adds to i?s/GL only 
the possibility of transient single-detector disturbances 
(such as that in the second data set), but not of the multi¬ 
detector-coherent transient signals we are now injecting, 
was found before to be safe for persistent CW signals. 
Now it turns out to be much safer for tCWs than the 
permanence veto, but still performs worse than Hg/GL in 
both noise data sets, with /\fS « 0.17. Hence, this 
is not a particularly safe detection statistic for tCWs. 

On the other hand, the full transient-signal-aware 
Hgts/GLtL yields a significant increase in detection effi¬ 
ciency over Hg/GL? even in the second data set where a 
transient single-detector disturbance and transient sig¬ 
nals are present together. It achieves /\/S « 0.08 in 
both data sets and yields up to 35% improvement in pdet 
for weak signals below this threshold. This is also consis¬ 
tent with the expectations for a StackSlide search when 
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taking into account the additional mismatch accrued by 
semicoherent statistics. [50l El] 

Again, there are no losses with the simplified ‘loudest- 
only’ detection statistic i?sts/GLtL,io only using the seg¬ 
ment m with = max£ (not plotted separately). 

However, a minor problem with Ssts/GLtL is easy to 
overlook in Fig. Even at very high /ig, where other 
detection statistics eventually reach p^et = 1, it misses a 
few signal injections, typically about l%-2%. 

We found that all missed injections are related to 
the previously-mentioned peculiarities of data selection, 
falling into segments 64 or 76, where one of the detectors 
contributes no or very little data. 

Here, the single-segment hypotheses and Hfg 

become indistinguishable, and the normalization of 
Hsts/GLtL with the given tuning values is such that it 
will veto any strong outlier from these segments. 

This issue is not a fundamental problem with our ap¬ 
proach, as the data selection for future semicoherent 
searches can easily be constrained to avoid such anoma¬ 
lous segments, making the search better suited for tCW 
detection without risking efficiency for persistent CWs. 

As a simple work-around for the given data selection, 
we can simply set ’ = 0.0 while keeping all other 

^u!/G ®Oual values. This makes Pdet(5sts/GLtL) go 

to 1 for high hg, just as Bs/gl does, and sacrihces only 
1-2% of pdet at lower hg, and a similar small amount in 
the persistent-CW case - which could also be recovered 
by slightly changing the 77^°^ tuning. 



ho/^S[l/Bz] 



ho/\/5[l/Hz] 


FIG. 6. Detection efficiency for transient tCW signals 
of random duration Tinj S [0.5, 2.0] X Tgeg and randomly dis¬ 
tributed over the whole observation time. Here, the signal 
amplitude hg has been rescaled for each injection to compen¬ 
sate for the varying Tinj; see the text for details. Detection 
statistics and panels are the same as in Figs, jljj^ 


D. tew signals with varying duration 


The pragmatic signal model for transient CW-like sig¬ 
nals from Eq. (27) explicitly assumes a signal lining up 
with a single segment, as tested in the previous section. 
As such an alignment is not very likely in nature, it is 
interesting to test the robustness of the new detection 
statistic against deviations from this assumption. Hence, 
we have also tested injecting transient signals with ran¬ 
dom lengths Tinj and with random start times uniformly 
drawn from [tstart,iend - Tlnj]. 


For better comparison with Fig.[^ the strength of these 
signals is scaled according to ho cx y Tseg/Tinj, so that the 
average signal-to-noise ratios (SNRs) for each nominal 
ho value are the same. (But note that the per-injection 
SNRs are not hxed, as they still depend on the three 
randomized amplitude parameters.) 

As an example, we show the results for the Gaussian- 
noise data set, and for Tjnj uniformly drawn from the 
interval [0.5, 2.0] x Tgeg = [30,120]h, in Fig. Here we 
find a general decrease in detection efficiency for all 
statistics considered, with T and Bs/GL only achieving 
h^o°'^°/VS « 0.14 instead of « 0.1 in the previous test and 


Hsts/GLtL achieving hQ°‘^°/VS « 0.1 instead of « 0.08. 
This is mostly due to the fact that now a significant frac¬ 
tion of injections falls completely or partially within gaps 
of the simulated data set, or within parts with very low 
duty factor, thus decreasing the effective SNR for any 
detection method. 

The main hnding here is that the transient-aware 
Bayes factor Ssts/GLtL still improves over the other 
statistics, and that this is still true even for the ap¬ 
proximate ‘loudest-only’ version. This indicates that, 
though the initial assumption of a tCW signal that ex¬ 
actly matches the duration of a single segment seemed 
quite strict and arbitrary, this simple approach is in fact 
useful for a wider range of tCW durations. 

We also observe that the permanence veto is not quite 
as strict in falsely vetoing these tCW signals as it was for 
Tinj = Tsegj as some fraction of them that overlaps with 
more than one segment can now still contribute to the 
reaveraged detection statistic. Still, it is not competitive 
with any of the other tested detection statistics, which is 
no surprise, since it was not constructed to accept tran¬ 
sient signals in the first place. 
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VI. CONCLUSIONS 


In this paper, we have considered J^-statistic-based 
semicoherent searches for CWs on data that also contain 
transient instrumental disturbances or signals that are 
transient, but also CW-like (quasimonochromatic). We 
have generalized the Bayesian model-selection approach 
of Refs, [m (HU |40] with explicit models for transients 
lasting for a single segment in a semicoherent search, in¬ 
cluding single-detector disturbances and multi-detector- 
coherent signals, and demonstrated that the resulting 
statistic can be effective even for transients of different 
durations. 


For this demonstration, we injected simulated CW and 
tCW signals into several simulated data sets, using real¬ 
istic duty factors corresponding to the two LIGO detec¬ 
tors during their sixth science run. We have shown that 
the new detection statistic i^sts/GLtL is rnore robust than 
standard semicoherent statistics towards both persistent 
and transient single-detector disturbances in the data, 
while not hurting sensitivity to persistent CW signals, 
and that it is more sensitive towards transient signals. 


We found that -Bsts/GLtL works best if the transient 
signals or disturbances indeed last for exactly a single 
segment of the semicoherent search. But it still yields 
significant improvement over standard methods for tran¬ 
sient durations shorter or longer than one segment length. 

Though the injection studies showed a minor issue with 
Ssts/GLtL dismissing a small number of strong tCW sig¬ 
nals, this was found to be related solely to a peculiarity 
of the data selection used in our tests. It can be worked 
around by prior tuning, while for future combined CW- 
tCW searches the issue can easily be avoided by a prop¬ 
erly constrained data selection. 


Just as with the line-robust statistic B 


S/GL 


of 


Pa¬ 


per I, the new detection statistic i?sts /GLtL only requires 
quantities that are already computed in any semicoher¬ 
ent J^-statistic search: namely the single-detector and 
single-segment J^-statistics. Hence, computational cost 
is only increased by the arithmetic operations in comput¬ 
ing Hgts/GLtLj as given in Eq. (23). Our injection studies 
showed that this is usually dominated by a few terms, and 
that good sensitivity can already be obtained by comput¬ 


ing Hgtg/QLtL only for the most significant candidates 
obtained from other statistics. Hence, these results allow 
for a CW search with increased robustness to transient 
disturbances and increased sensitivity to transient signals 
as a computationally cheap ‘add-on’ to existing searches, 
such as the Einstein@Home project [iH] , 


The present approach could be further generalized to 
allow for an explicit classification of periodic data signa¬ 
tures into several classes (persistent CW signals, persis¬ 
tent lines, transient disturbances, and ‘transient-CW’ sig¬ 
nals), with transient lengths of any multiple of a segment 
length, through the Bayesian Blocks algorithm [S3] . 


Multi-detector transient disturbances cannot be safely 
distinguished from transient astrophysical signals by con¬ 


sidering the per-segment and per-detector J'-statistics 
only. However, for widely separated detectors these are 
much rarer than single-detector disturbances, and hence 
it should be possible to investigate any coincident tran¬ 
sient candidates with more detailed analysis of their fre¬ 
quency evolution and coherence with auxiliary channels. 

As this work shows how to increase tCW sensitivity by 
simple modifications to existing CW searches, other more 
dedicated search methods might be more powerful, but 
not without significant computational effort, so that the 
present results will find practical applications regardless. 
Still, sensitivity comparisons of tCW detection methods 
between this work, the dedicated tCW detection statistic 
of Ref. |3l| and the more generic ‘long-transient’ excess- 
power method of Ref. [33| would be of great interest for 
the development of optimal overall search strategies. 

Such a comparison will, however, require significant 
further work, expertise from different sub-fields of GW 
data analysis, and great care in ensuring a comprehen¬ 
sive and fair evaluation of each approach under equal 
circumstances. Hence, it would be an interesting oppor¬ 
tunity for a community mock-data challenge modelled 
after a recent study [51] of methods to detect CW sig¬ 
nals from the binary source Sco-XI, which also compared 
CW-optimized and generic search methods. 
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Appendix A: ‘cheat sheet’: 
pictorial representations of Bayes factors 

In this appendix, we provide a simple pictorial rep¬ 
resentation of the various detection statistics (Bayes 
factors) derived from Bayesian hypothesis testing in 
Refs. [lU HO] and in this paper. 

Here we consider the simplest example case that still 
allows distinguishing all of our hypotheses "Hq, B-s, Bl, 
Bts and Bti,: this is the case of two detectors A = 1,2 
and two data segments £ = 1,2. We then represent with 
□ any hypothesis that posits pure Gaussian noise in the 
single-detector, single-segment data subset Al¬ 

ternatively, ■ depicts any hypothesis that posits a quasi¬ 
monochromatic signature, be it a signal or a disturbance. 
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in (X,£). 

We then build up sketches of the full semicoherent hy¬ 
pothesis by arranging the two detectors on the horizontal 
axis and the two segments on the vertical axis. 

This way, the signal-vs-Gaussian Bayes factor from 
Eq. Q, which is equivalent to the .F-statistic [40], can 
be represented (up to proportionality, corresponding to 
global priors) as 


statistic from Paper I reads as 


Bs/l{x) fx p. ■ □ 
V ■ □ 


x) + Pi^'\x) ’ 


(A2) 


and the more general line-robust statistic, reproduced 
here in Eq. (131, is 


Bs/g{x) oc oc 


p 

Bs 

x,I^ 

p(::ia:) 

p 

^Hg 

x,l] 

□ □ 

□ □ 


• (Al) 


^s/GL(a^) oc 




(A3) 

In this paper, we have generalized the approach of|Pa-| 


Using the same sketch notation, the pure line-veto per I to yield 


(i) a detection statistic that takes into account transient lines in any single-detector, single-segment subset as an 
additional noise component in the denominator, given in Eq. (22), and that we can sketch as 


Bs/Ghth{x) oc 


p(::ia.) 


p ( on 
^ y n n 


®) + -P(5nla^) + ^’(R5l®) + ^(SRI®) + -P(R 


^ \ I 7-) / h 


(A4) 


(ii) another detection statistic, given in Eq. (35), that also takes into account ‘transient-CW’ signals in the numerator, 
which as a sketch looks like this: 


i?Sts/GLtL(*) OC + 


P{ 


^)+Pi^ 




a;) + P( S g I a;) _ 

a:) + P(H-|a:)+P(gH 


x)+P{^^\x)'^ 

(A5) 


(iii) a pure transient-CW detection statistic, as given in Eq. (40), and with the following sketch form: 


5ts/GLtL(a:) oc 


p C □ □ 

^ \ n n 


x) 


+p{:^\^)+pi^ 


^(55l^) + ^(gg 

: I ^)+p{ 


x) 


_ □ 

n n 


-CPIS 


, 7-) / n n 


(A6) 
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